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ABSTRACT 

In this paper we present the guideUnes for an extrasolar planet taxonomy. The discovery 
of an increasing number of extrasolar planets showing a vast variety of planetary parameters, 
like Keplerian orbital elements and environmental parameters, like stellar masses, spectral types, 
metallicity etc., prompts the development of a planetary taxonomy. In this work via principal 
component analysis followed by hierarchical clustering analysis, we report the definition of five 
robust groups of planets. We also discuss the physical relevance of such analysis, which may 
provide a valid basis for disentangling the role of the several physical parameters involved in the 
processes of planet formation and subsequent evolution. For instance, we were able to divide the 
hot Jupiters into two main groups on the basis of their stellar masses and metallicities. Moreover, 
for some groups, we find strong correlations between metallicity, semi-major axis and eccentricity. 
The implications of these findings are discussed. 

Subject headings: planetary systems — planetary systems: formation — planets and satellites: general 



1. Introduction 

With the discovery of the first extrasolar planet 
(EP) orbiting a solar-like star ( Mavor fc Queloj 
19951 ) a new field of astronomical research started. 
So far, more than 200 extrasolar planets have 
been discovered. Despite severe observational bi- 
ases -only partially overcome by the development 
of more sophisticated techniques and facilities- 
the most striking fact is the vast variety of EPs, 
and the remarkable difference with respect to the 
planets of our Solar System. Traditional theories 
of planetary formation have been seriously chal- 
lenged by these discoveries, and there are many 
aspects that still need to be understood in de- 
tail, like planetary migrations (disk-embedded vs 
planet-planet interactions) and the infiuence of 
the metallicy on the planetary formation pro- 
cesses, just to mention some. The heterogene- 
ity of EPs has been analyzed in several papers, 
and so me important trends have been uncovered 
e.g. seelZucker fc Mazehll2002l:ISantos et al.l2003t 



Udry et al.ll2003HEggenberger et al.ll2004 ISozzet'ti 
2004l:lFischer fc Valentill2005l ) . On the other hand. 



planetary formation is a very complex process, 
where a number of parameters have important ef- 
fects, as shown by several theoretical works. It 
is also likely that some parameters act simultane- 
ously in a complex way, thus motivating the need 
for a multidimensional approach going beyond the 
search for simple correlations among individual 
parameters. In other words, the planetary forma- 
tion process occurs in a multidimensional space. 
In addition to Keplerian orbital elements and stel- 
lar properties, other relevant parameters may be 
discovered in the future, as the formation theories 
improve. 

We adopt a multivariate approach to EPs in order 
to uncover underlying trends which may provide 
important information about the planet formation 
processes. The first step is the development of a 
robust taxonomy for EPs. Taxonomy, just as in 
other fields of research, may be a precious tool in 
defining clusters of EPs, which in turn may high- 
light differences in the formation processes and 
subsequent evolution. 

This will be of particular importance with the in- 
creasing number of EP discoveries which is ex- 
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pected to occur in the near future thanks to sev- 
eral space missions (e.g. CoRoT and Kepler) and 
ground-based surveys. For the moment, the num- 
ber of EPs is about 200, which may prevent firm 
statistical conclusions. Nevertheless, here we pro- 
pose the guidelines for an EP taxonomy, to be re- 
fined as more data become available. 
In the next section (2), we describe the parame- 
ters used for this study and we perform a statis- 
tical approach of the data for dimensional scaling 
purpose. Then, in section (3), cluster analysis is 
performed. We caution from the beginning that 
these analyses -much like all statistical analyses- 
are not unique, since several different criteria may 
be used for characterizing the data. There is no a 
priory best criterion: the choice can indeed vary 
according to the kind of data under analysis. We 
shall provide a step-by-step justification for all the 
choices made. Finally, in section (4), the solution 
will be discussed along with its physical interpre- 
tation. 

2. Multivariate analysis and dimensional 
scaling of EPs 

The inputs to our model are the elements pro- 
vided by the interactive extrasolar planets catalog 
mantained by J. Schneidei0. These are: planetary 
projected mass (Mp), orbital period (P), semi- 
major axis (a), eccentricity (e), inclination (i), 
stellar mass (Ms), and stellar metallicity ([Fe/H]). 
Other possibly important parameters, like stellar 
spectral type and stellar age, have not been consid- 
ered at this stage, but will be the subject of further 
refinements. Only objects having simultaneously 
estimates for {Alp, P, a,e, Ms,[Fe/B]} have been 
used. Notice that the period and semi-major axis 
are obviously correlated, thus only one of them is 
used (see the following). We refer to them as the 
input variable^. Therefore, each planet can be re- 
garded as a point in a 5-fold space. The following 
analysis has been carried out in the IDL language, 
and it is structured so as to be easily updatable 
as more data become available. 



We consider 183 EPs (updated at 8 November 
2006). To them, the Solar System planet Jupiter 
has been added. This has been done because 
Jupiter-like bodies are approaching the observabil- 
ity limit in extrasolar planetary systems thanks 
to recent improvement in the surveys, and also 
to have a direct comparison with our own Solar 
System. We decided not to add the other Solar 
System planets as they are still well below the 
detection limits. The first step is to perform a 
statistical analysis in order to find out if there are 
useless -or less significant- input variables. This is 
done using principal component analysis (PC A). 
As this is a standard technique in multivariate 
data analysis, details will not b e discussed here 
(inter ested readers may consult lEveritt fc Dunn 
20011 ). The basic idea of PGA is to combine the 
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■^We chose to use simply Mp and a instead of performing 
some transformation of the data prior to the statistical 
analysis, as done by some authors which made recourse to 
logarithms, because there is no strong physical reason for 
that. Anyway, we plan to exploit this possibility in further 
works. 



input variables in such a way as to show those 
of most importance. This is done by describing 
the data with a number of new variables pci...pci 
for 1 — 5, ordered in terms of decreasing vari- 
ance. The pCi are chosen in such a way as to 
be uncorrelated with each other. This is done in 
practice by building an n x I input data matrix 
(where n is the number of planets). From this 
matrix, the I x / correlation or covariance ma- 
trix is computed, where the correlation matrix is 
more appropriate whenever the input variables are 
measured with disparate units. The pCi are eigen- 
vectors for those matrices. Considering that input 
variables have different units and that some span 
orders of magnitudes (e.g. Mp) while others span 
a limited range (e.g. Ms), we opted to perform 
variable standardization of the input variables, i.e. 
to scale input variables in such a way as to obtain 
a mean and variance of 1. This has the advantage 
of allowing the use of the covariance matrix, for 
which the eigenvalues (Ai) represent the portion of 
the variance of the original data which the corre- 
sponding eigenvectors {pCi) account for. PCA can 
thus provide a useful means for finding variables 
of little significance. On the basis of the variance 
attained by each pci we may reject some of them. 
This procedure has the advantage of using only 
the variables which are important, allowing a sim- 
pler description of the data set with only a minor 
loss of information. 

As a first step, we used PCA to decide whether 
the period or the semi-major axis is more suit- 
able for the statistical analysis (see also §3.2 on 
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this point). PC A was then performed on the 
{Mp, P,e,M„ [Fe/H]} and {Mp, a, e, M„[Fe/H]} 
spaces. The choice was made by requiring that 
the variance of the resulting pci is more concen- 
trated in the first principal components. The 5- 
fold {Aip,a, e, M, [Fe/H]} space turned out to be 
slightly more appropriate. Now, no commonly 
accepted rules exist for deciding which variables 
-if any- can be safely thrown away. One criterion 
suggests keeping variables which account for 70 - 
90% of the total variance (|Everitt fc Dunnll200l[) . 
Thus it seems reasonable to keep only the first 
three principal components which account for 73% 
of the total variance. The fact that we can use 
only 3 principal components allows a graphical 
representation and a better control of the results. 
Moreover, dimensional scaling has the effect of 
simplifying the clustering analysis. This is why 
we employed PC A, but we are aware it is not 
strictly necessary for the development of a taxon- 
omy. In fig. [T]the first three principal components 
are shown. 

To the accepted levels of variance we may write 
the following decomposition formulae: 



pci = -0.206 -(Mp- 2.64) -0.543 -(a -1.19) 

3.417 • (e - 0.25) - 1.823 • {M, ~ 1-03) + 

0.067- ([Fe/H] - 0.1) 
pc2 = -0.083 -(Mp- 2.64) - 0.030 -(a -1.19)- 

0.335 ■ (e - 0.25) + 3.080 • (M^ - 1.03) + 

3.921 ■ ([Fe/H] - 0.1) 
pc3 = 0.090- (Mp - 2.64) - 0.329- (a- 1.19) -f 

1.795 - (e - 0.25) - 1.748 - (M^ - 1.03) + 

2.063 - ([Fc/H] - 0.1) 

where each pci is expressed in terms of a lin- 
ear combination of all the input variables. No- 
tice that [Fe/H] has little influence on pci, while a 
and e basically do not contribute to pc2 (consider 
that input variables span different ranges, so for 
instance, the term containing e in equ. 2 has lower 
importance than the term involving Mp). These 
formulae may be used to add new planets into the 
database, without repeating the PCA (this is use- 
ful when we want to add only single objects which 
do not alter the overall statistics, otherwise PCA 
has to be repeated). 

Finally notice that we also checked for the pres- 



ence of overall correlations within the database. 
We found significant correlation (see also below) 



between Mp — e, a — e and Alp 



However, over- 



all correlations have been widely anal yzed elsewere 
and will be not discussed here (e.g. ISantos et al. 
2005, and references therein). 

3. Cluster analsysis 

Cluster analysis is a standard technique used 
in a variety of research fields from social sciences 
to geology, engineering and so on. The main pur- 
pose of such analysis is to find clusters in a given 
dataset such that elements belonging to the same 
cluster have a certain degree of homogeneity, while 
elements of different clusters have to be as differ- 
ent as possible. As with PCA, there is no unique 
way of performing the analysis and a number of 
clustering algorithms arc available. The choice of 
the clustering technique is quite arbitrary and it 
relies mostly on the kind of description of the data 
we are interested in. When the number of clusters 
is not known a priory, as in our case, hierarchi- 
cal clusteri ng is more suitable , thus we adopt this 
technique (jEveritt et al.ll200ir ). The major advan- 
tage of this technique is that it provides a classi- 
fication which consists of a series of nested parti- 
tions, which go from a single cluster containing all 
objects to n clusters each containing a single ob- 
ject. This process is based on a proximity parame- 
ter, usually a measure of distance among objects. 
Moreover, this classification may be represented 
by a two-dimensional diagram known as a dendro- 
gram, which illustrates the nested nature of the 
hierarchical partition. However, there are a num- 
ber of possible ways to perform the analysis, and 
an accurate step-by-step evaluation of the process 
has to be performed. In the following we refer to 
the variables used for the clustering (namely the 
{pCi}) as clustering variables. The general guide- 
lines for hierarchical cluste r analysis are the f ol- 
lowing (for more details see lEveritt et al.ll200l[ ): 



• standardization of the clustering variables; 

• computation of the proximity matrix, that 
is, evaluate the degree of proximity among 
dataset members. This is usually done by 
computing the distance in the clustering 
space (in our case the 3-fold {pci,pc2,pc'i\ 
space). Notice that several metrics may be 
used for this purpose; 
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definition of inter-group proximity measures. 
These specify the method used to quantify 
the proximity level of two clusters. Several 
methods are available, like single linkage and 
centroid linkage; 

computation of the distortion of the dendro- 
gram. The dendrogram itself is a represen- 
tation of the original data, where the sep- 
aration of two members is specified by the 
minimum distance between the two clusters 
which contain the objects. Thus the quality 
of a dendrogram can be evaluated by com- 
paring the original distance (i.e. in the clus- 
tering space) of members with the distance 
assigned to them by the dendrogram. This 
is formally done by the so-called cophenetic 
coejjicien}^ (c). It normally ranges from 0.6 
to 0.95, where higher values correspond to a 
lower distortion; 

definition of the best partition. In standard 
analysis this means stopping the hierarchy at 
a given distance; in other words to cut a den- 
drogram at a particular distance (or height, 
h). Again, no unique rule exists to define 
the best cut. Here we follow the procedure 
(|l977l ). the so-called 



suggested by iMoiena 



upper tail rule. In detail, we first evaluate 
the distribution of the heights N{h), then 
the best cut height is determined by 



hy = h 



7 • 



where 7 is a coefficient which ranges from 
1.25 to 3.5 (iMilhgan fc Coopeil Il985l) . h is 
the mean of the height distribution and ah 
its standard deviation. A possible way to 
define the best 7 is to find the values of 7 
for which the maximum separation between 
two consecutive solutions is achieved; 



^To compute c proceed as follows. First, compute the dis- 
tance between two planets (indicated by indices i and j) 
assigned by the dendrogram. Let dij be this distance, and 
d the average value of all dij. Let rij be the real distance 
between planet i and j, and r the average value of all rij. 
Thus: 

I2^Y.j>i{r^J -r){dij -d) 
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• testing of the quality of the solution against 
the absence of clusters. 

3.1. Finding the best solution 

First we decided not to standardize the clus- 
tering variables as this may reduce the difference 
among members, making the identification of clus- 
ters more difficult. Moreover, as a general rule, 
the same metrics should be used for the proximity 
matrix and the inter-group proximity measures. 
We explored different metrics and the effects of 
different algorithms of inter-group merging. The 
metrics used are: Euclidean, City Block, Cheby- 
shev, and correlative (based on the Pearson cor- 
relation coefficient). The inter-group algorithms 
considered are: single linkage, complete linkage, 
weighted pairwise average and weighted centroid. 
The full set of possibilities has been investigated 
by using the cophenetic coefficient and analyzing 
the corresponding dendrograms, c ranged from 
about 0.55 to 0.84. Although useful, the cophe- 
netic coefficient cannot be the only parameter for 
finding the best solution; very different hierarchies 
may have the same c. Here, we also considered 
other features in order to define the best solution. 
Among them, we first select the dendrograms hav- 
ing N{h) distributions with the lowest variance, 
i.e. those more peaked for low heights. Then, we 
required the tails of the N{h) distributions have 
to be very "discrete" for large heights, that is the 
solutions with k and k + 1 clusters (for low values 
of k) had to be well separated. Both conditions 
help to define a robust solution which is closely 
nested for small heights and stable against errors 
(e.g. observational errors) on the position of the 
EPs in the clustering space. 

Taking into account these constraints, the best so- 
lution was obtained with the Pearson correlation 
distanceQ and weighted centroid merging. This 
solution corresponds to c = 0.83. According to 
the upper tail rules, the best cut produces five ro- 
bust clusters. Remarkably, the interval of heights 
for which the solution is stable corresponds to 7% 
of the maximum height (notice that the average 
separation between two consecutive clusters is of 
the order of the maximum height divided by n, or 
~ 0.5%). The corresponding best solution dendro- 



*The distance between two items i,j is defined as dij = 
[(l — (f>)/2]^/'^, where (f> is the Pearson correlation coefficient. 
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gram is shown in figure [2] The dendrogram is well 
structured: for low heights the clusters are closely 
nested (i.e. small variation of the height results 
in a significant change in the number of clusters); 
while for higher values the clusters are well sepa- 
rated. 

It is interesting to note that traditional metrics 
(like the Euclidean) and traditional cluster merg- 
ing (like single linkage) produce, in general, bad 
results. It turns out that they are not able to 
find structures for EPs. This is in agreement with 
the analysis of Jiang et~al\ ( 2006f l , although it was 
performed with a very different approach to that 
adopted here. The main problem of these hierar- 
chies is that either they produce just one or two 
big clusters and some outliers, or the solution is so 
nested that the definition of cut-off is too arbitrary 
and meaningless. 

3.2. Testing the solution 

Before analyzing in detail the best solution 
found, we have to check for its robustness against 
the absence of clusters. We recall that cluster 
analysis will always produce a solution. The test 
for the absence of structure is done by an accurate 
evaluation of the intra-cluster separations. In gen- 
eral, unless we are dealing with data with very well 
separated clusters, the clusters will tend to have 
some degree of overlap. The overlap between two 
clusters can be estimated by comparing the dis- 
tance of two clusters with respect to their volume 
in the clustering space. In practice, we estimate 
the center of each cluster (ci), and its radius (r^), 
defined as the sphere which contains a given per- 
centage of the cluster members (both cl and are 
computed with a Euclidean metric) . Thus the de- 
gree of overlap between clusters i and j is set to: 
'^ij — + — CjW- Where for < 1 the 

clusters i,j are well separated. The overall overlap 
of the solution is defined as: 



UJ UJ.i, 

i>j 



if 



> 1. 



Higher values of uj imply larger overlap. Notice 
that a; is a cumulative parameter and is sensitive 
to the number of single intra-cluster overlaps, as 
it should be. To clarify this point, imagine a so- 
lution with a given degree of overlap between two 
clusters, and another one with the same degree 



of overlap but among three clusters. The aver- 
age degree of overlap will be the same, but the 
UJ of the second solution will be higher than that 
of the first. Indeed the first solution is better 
than the second since only two clusters are par- 
tially merged. For the best solution, we obtained 
UJ — 12.3. One way of testing the absence of clus- 
ters is with Monte Carlo simulations. We then 
performed a number of simulations (with 10^ so- 
lutions each), generating n random points in the 
clustering space. The points are generated with 
a uniform distribution along each axis. Then we 
run the clustering analysis using the same proce- 
dure described before, varying also the volume of 
the Monte Carlo generated points in the cluster- 
ing space, and the percentage of objects used for 
defining the cluster's radii. The probability of ob- 
taining a; < w is less then 10% in all cases. Thus 
we may safely reject the possibility of the absence 
of structures. For the shake of completeness, we 
also perform additional tests to see whether our 
initial choice of using a instead of P is indeed a 
good choice in terms of taxonomy. Performing 
clustering analysis using P instead of a, we obtain 
a solution which has w = 20 and is compatible 
with the absence of structures. A possible expla- 
nation of this behavior could be due to the fact 
that P, a and Mg are related. As a result, the use 
of the pair a, Ms is better than the pair P,Ms. 
This further strengthens the case for the use of 
the semi-major axis. 

Moreover, we also tested the solution with respect 
to the presence of observational errors, which can 
be very large in some cases. To do this, we gener- 
ated a random error in the input variables around 
the nominal values for all the EPs. We then re- 
peated the PCA and cluster analysis of these ficti- 
tius samples (one for each of the input variables). 
The solution obtained has been compared with 
the best solution. For errors of a few percent, we 
find that some planets may move to other clusters, 
but this is limited to a few objects. Increasing the 
errors, sometimes a cluster may split into two sub- 
clusters. For errors larger than 10% the solution 
may alter considerably. In general the best solu- 
tion does not change much for errors up to ±10% 
for each of the input variables. This is very im- 
portant, since it means that the solution is quite 
stable, in particular considering that we are deal- 
ing with projected planetary masses and not real 
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masses. 

The process developed for EP taxonomy is sketched 
in fig. H 



4. Analysing the clusters 

Our best solution is composed of five robust 
clusters. In this section we present the properties 
of each cluster. Figure [3] shows the position of 
clusters in the clustering space. We checked for 
inter-correlation among the input variables within 
each cluster. It is commonly accepted that plan- 
ets may form in different ways (core accretion vs 
disk instability), and that their evolution is af- 
fected by several parameters (disk density, stellar 
types, opacity etc). The EP database may reflect 
such complexity, however the signature of these 
processes may be blurred in statistical analyses 
which deal with the whole EP dataset. On the 
contrary, if cluster separation has something to do 
with the formation and evolutionary processes and 
is not just a mere classification, it becomes impor- 
tant to look for trends within each cluster. In the 
following we report only highly significant (with 
a 2-tailed probability less than 5%) intra-cluster 
correlations. 

4.1. Cluster CI 

Containing 11 EPs (see tables [1] and for a de- 
tailed description) this is the least populated clus- 
ter. We can define a prototype planet, that is the 
object closest to the center cl of the cluster. The 
prototype is HD 41004 A b. Figure [5] shows the 
distribution of the input variables for each cluster. 
The EPs of this cluster are characterized by a wide 
range of planetary masses, from about 0.2 AIj to 
18.4 Mj. The planetary semi-major axis ranges 
from 0.018 AU to 1.97 AU. The eccentricities are 
quite spread, from 0.08 to 0.63. The stellar masses 
are remarkably sub-solar (except for HD 8574 b 
which has 1.04 M©). Finally, star metallicity is 
very spread, from -0.5 to 0.28 dex. Cl contains 
several peculiar EPs. HD 41004 B b, HD 162020 b 
HD 114762 b are very massive planets, with 18.4, 
13.7 and 11 Mj respectively (see tabs. [T]and[2). 
The first two are also hot Jupiters (notice that 
HD 162020 b could be a brown dwarf, accord- 



ing to lUdrv et al.ll2002f ). This cluster contains 6 
EPs in multiple star systems (MSS), see tables [1] 



and [2] (dat a on multiple star system s have been 
taken from iDesidera fc Barbieri 2006f ). Moreover 
HD 41004 B b, HD 162020 b, HD 114762 b and 
HD 111232 b orbit low mass, low metallicity, stars. 
Despite this variety, the objects in this cluster have 
to be considered "similar" in terms of clustering 
analysis and "different" to the other EPs. We un- 
derline that the cluster analysis has been done in 
the {pci} space, therefore the input variables may 
vary considerably within a cluster. 
Significant intracluster correlations exist between 
Mp-e and Mp - Ms (see fig. E]). Notice that Mp 
is anticorrelated with Mg . This is somehow unex- 
pected as for higher we would expect a higher 
dust surface den sity for the protoplanetary disks 
(|lda fc Linll2"005h and hence more massive planets 
(consider also that here we have sub-solar stellar 
masses; on this point see also C4)- We may argue 
that this has something to do with the peculiar 
way these EPs formed, but no firm conclusion can 
be drawn yet. 



4.2. Cluster C2 

This cluster contains 46 EPs (see tables [T] and 
[2]). The prototype is HD 69830 c. This cluster is 
characterized by an Mp distribution with an av- 
erage of about 1 Mj and a standard deviation 
of 1.2 Mj, clearly peaked at low masses. Most 
of the planets have masses below 1.3 Mj. The 
semi-major axis distribution is characterized by 
two distinctive groups, one peaked at very small 
a and the second, far less numerous, centered at 
a ~ 1.6 AU. The overall distribution has an av- 
erage of 0.46 AU and a standard deviation of 
0.7 AU. Planetary eccentricities are moderate-to- 
low, 0.11 ± 0.11, and stellar masses are remark- 
ably sub-solar, 0.83 ± 0.2 Mq. Star metallicities 
are around solar, with an average of -0.04 dex, 
and a standard deviation of 0.22 dex (see fig. [5]). 
Gl 581 b, Gliese 876 b-c-d and GJ 436 b which 
orbit low mass stars (respectively with 0.31, 0.32 
and 0.41 M© they are the lowest Mg in the sam- 
ple) also with low metallicity (respectively -0.33, 
-0.12, -0.32 dex) belong to this cluster. 
C2 contains 17 hot Jupiters (that is 37% of its 
members), and 4 EPs belong to MSSs (see tables 
[Hand [2]). It also contains 5 transiting EPs (the 
total number of transiting EPs is 14 -at December 
2006- bu t only 9 are involved in the present anal- 
ysis; see lBurrows et al.l 120061 ). These 5 transiting 
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EPs do not seem to have any particular proper- 
ties, exept to have the highest stellar ages of the 
sample (but we warn that those ages may not be 
well constrained). Finally, contains 13 planets 
in multiple planet systems (MPS). 
The significant intra-cluster correlations are a — e 
and a— [Fe/H] (see fig. [7]). The first one is very in- 
teresting because a is anti-correlated with e. Thus 
planets further away from the stars have higher 
eccentricities. In other words, either the excita- 
tion of e is more effective further away from the 
central star (assuming that planets form in circu- 
lar orbits) and/or e dumping is more effective for 
lower a. This result is consistent with tidal cir- 
cularization of planets with small a, however no- 
tice that at a ^ 0.02 AU the average e is about 
0.1, that is, still considerably non-zero (we caution 
that low e values c ould be affec ted by biases due to 
fitting procedures, lFordll2005h . Moreover, with a 
notably steep trend, a anti-correlates with [Fe/H]. 
Thus either the planetary migration is more pro- 
nounced for high [Fe/H] (which is in agreern ent 
with simulations, e.g. see iLivio fc Pringlell2003h or 
giant planets of this cluster may form close to stars 
(< 1 AU) in high metallicity environments. We 
recall that for [Fe/H]> the m ost accredited for- 
mation theory is core acc retion (jFischer fc Valenti 
20051 : ISantos et all l2005h . Notice also that the 
distribution of the MPS planets in the a— [Fe/H] 
plane is somehow opposite to the overall observed 
trend: without them the correlation would be even 
more pronounced. Both trends are very impor- 
tant because, for the first time, they show sig- 
nificant dependence between metallicity and or- 
bital parameters (on the possibl e existence of a 
period-metallicity correlation see ISozzettill200l . 
These findings also suggest that lower [Fe/H] plan- 
ets tend to have larger or bits, makin g them dif- 
ficult to detect (e.g. see iBosd l2002l ). In turn, 
this may affect the probability of f orming plan- 



ets with respect to the meta llicity (jSantos et al 
20041 : iFischer fc Valentil[2005h . 



4.3. Cluster C3 

Containing 48 EPs (see tables [T] and H]) this 
cluster, along with C4-, is the most populated. 
The prototype is HD 11964 A b, and it contains 
Jupiter. This cluster is characterized by an Mp 
distribution peaked at low masses: basically all 
EPs are below 2.5 Mj. The semi-major axis dis- 



tribution is very peaked at low values: most of 
the bodies are below 0.25 AU, with a second, less 
numerous, peak at about 1 AU. Eccentricities are 
below about 0.3, having an average of 0.096 and a 
standard deviation of 0.091. Stellar masses span 
from 0.98 to 1.24 M©. Metallicities tend to be 
super-solar and vary from 0.02 to 0.35 dex (see fig 
El). 

The significant intra-cluster correlations are: Mp~ 
Ms, Mp-[Fe/H], a-Ms, a-[Fe/H], e-[Fe/H], and 
Mg — [Fe/H]. However the first three are due to out- 
liers: without them the correlations do not hold. 
Therefore they are not considered interesting. On 
the contrary, the latter three -all involving the 
metallicity- are robust (see fig. [5]). First of all, 
a is anti-correlated with [Fe/H]. Thus either the 
planetary migration is more pronounced for high 
[Fe/H] (see also C2) or planets with lower [Fc/H] 
form at larger distances. A striking result is that 
Jupiter fits very well in this cluster: its large a 
would be the result of its formation in a solar-like 
metallicity disk. Moreover, e is anti-correlated 
with [Fe/H]. In other words, higher metallicities 
correspond to lower eccentricities. Thus either the 
excitation of e is more effective -or dumping is less 
effective- for lower metallicities. In order to under- 
stand the meaning of these correlations, two fur- 
ther points have to be considered. First, the exis- 
tence of a— [Fe/H] and e— [Fe/H] correlations does 
not imply a correlation between a and e. Indeed 
they are not correlated to the level of confidence 
adopted here. Moreover, the members of this 
cluster have very super-solar metallicities. These 
correlations, if related to the formation processes, 
may give important indications about the origin of 
high ec centricities. Many th e ories have been pro- 
posed dHolman et al.l 1 19971: iMurrav et all 120021: 



Goldreich fc Saril 



Zakamska fc Tremaind 



Adams fc Laughhnl l20o'6[ 



2006h but 



20041: iNamounil I200E 
D'Angelo et al.l 120061 : iFregeau et al 
none has observational support yet. Here we 
suggest that the metallicity acts in some way in 
determining e. For instance, since high [Fe/H] 
produces a faster migration, the low values of e 
observed for high [Fe/H] may be the result of 
the migr ation process, e.g . tida l circularitazion 
(see also Halbwachs et al.l |2005[) . Alternatively 
the condition for the p umping up of e during 
planet-disk interactions (ISari fc GoldreichI 12004 
Matsumura fc Pudritd l2006h is not achieved in 
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high [Fe/H] environments: indeed higher [Fe/H] 
imphes higher disk viscosity and hence a lower 
pr obabihty of e excitatio n (see equ. 8 and 9 
of ISari fc GoldreichI 120041 ). If confirmed, these 
trends suggest that planetary orbital parame- 
ters are mainly controlled by disk properties (e.g. 
metallicity) rather than being affected by external 
factors, like perturbation due to interactions with 
a companion star or star encounters. 
Finally, Mg is anti-correlated with [Fe/H]. It is not 
easy to understand this correlation, and in partic- 
ular if it has something to do with the planetary 
formation process. We note that EPs in this clus- 
ter (which are quite confined in terms of a and Mp) 
orbit stars whose metallicities tend to increase as 
stellar masses decrease. If we assume that higher 
Ms implies higher proto-planetary disks masses 
from which EPs formed, this implies that to form 
planets in lower metallicity environments a more 
massive disk is required, in agreement with the 
core accretion theory. 

Eleven EPs are in MSSs (see table [2|). It contains 
also 23 hot Jupiters (48% of the members) and 
8 MPS planets. They are well spread and seem 
not to affect the a— [Fe/H], e— [Fe/H] correlations. 
However, all MPS planets and most of the hot 
Jupiters lie above the linear fit in the M^ — [Fe/H] 
plane. Notice also the MPS planets of this cluster 
have e < 0.2 and [Fe/H]> 0.2 and for this reason 
they differ from those oi C2 (which have e < 0.3, 
[Fe/H]< 0.2) and those of (which have e > 0.3). 

4.4. Cluster C4 

This cluster contains 48 EPs (see tables [T] and 
12). The prototype is HD 142022 A b. C4 is 
characterized by rather flat distributions of the 
input variables (see fig. E]). In terms of plan- 
etary masses, it contains very massive bodies, 
basically all having Mp > 2.0 Mj. Apart from 
very few exceptions, it contains all the EPs of the 
dataset having mass greater than 5 Mj. The 
average and standard deviations are 5.45 and 
3.92 Mj, respectively. The semi-major axis distri- 
bution is also quite flat, ranging from about 0.5 to 
5 AU. Mean and standard deviations are 1.98 and 
1.27 AU, respectively. Eccentricities are remark- 
ably moderate-to-high, spanning from 0.3 to 0.8 
(mean and standard deviation are 0.49 and 0.18, 
respectively). Stellar masses are mainly around 
one solar mass, with a slight overabundance of 



super-solar mass objects. Mean and standard 
deviations are 1.06 and 0.13 Mq, respectively. 
Metallicities span from -0.25 to 0.3 dex, having 
an average and standard deviations of 0.14 and 
0.17 dex, respectively. Thus despite its wide dis- 
tribution of metallicities, EPs are mainly super- 
solar. 

Two significant correlations exist for this cluster: 
Mp - e and Mp - Mg (see fig. H]). The first im- 
plies that lower mass EPs have higher e, thus the 
mechanisms for the pumping-up of the eccentricity 
are more active in low mass planets, at least for 
the high semi-major axes and moderate positive 
metallicities of this cluster. Moreover, EP masses 
are correlated with stellar masses. This may con- 
firm the fact that higher Ms implies larger proto- 
plan etary disk surfa ce density and hence larger 
Mp (llda fc Linll2005h . 

C4 contains 12 EPs in MSSs and 12 in MPS (see 
tables [Hand 121). Notice that the MSS planets may 
be responsible of the Mp — e correlation as many 
of them have low Mp and high e. 

4.5. Cluster C5 

This cluster contains 31 EPs (see tables [1] and 
[2), and the prototype is HD 117207 b. Plane- 
tary masses have intermediate values, with a mean 
of 2.16 Mj and a standard deviation of 1.24 Mj, 
respectively. The semi-major axis distribution is 
rather flat spanning from 0.37 to 3 AU, with a few 
bodies around 4 AU. Eccentricities are peaked at 
0.2-0.3, and range from 0.2 to 0.5. Stellar masses 
are super-solar, having mean and standard devi- 
ation of 1.22 and 0.21 Mq, respectively. Stel- 
lar metallicities are also remarkably super-solar, 
having mean and standard deviation of 0.15 and 
0.15 dex (see fig. [5]). 

The formally significant correlations are: Mp — Ms, 
Mp— [Fe/H] and a — e. However they are all due 
to outlier planets and thus cannot be considered 
as real (notice that if we do not consider the out- 
liers, the correlation a — e is very close to the 5% 
significancy level, thus it may become significant 
as more objects are added to this cluster by future 
discoveries). 

It contains 7 EPs in MSSs and only one hot Jupiter 
(namely, HD 118203 b). It is interesting to un- 
derstand why this hot Jupiter is in this cluster. 
The peculiarity of HD 118203 b is that it has the 
highest eccentricity (0.309) among hot Jupiters. 
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Notice that HD 185269 b (in C3) also has a very 
high eccentricity of 0.3. AU the other input vari- 
ables are the same, except for the planetary mass. 
HD 118203 b with a mass of 2.13 Mp is one of 
the most massive hot Jupiters. This explains why 
HD 118203 b has been put in this cluster. This 
cluster contains 13 EPs in MPS. 
Despite the lack of any correlation, few comments 
may be added to uncover the nature of this clus- 
ter. C-^ and C 5 have some similar traits. They have 
spread and rather flat distributions of input vari- 
ables. Moreover, both have large semi-major axes 
and eccentricities. However, C5 contains objects 
with higher stellar masses, lower eccentricities and 
lower masses than C4- 

5. Discussion and conclusion 

In this paper we develop the basis for an ex- 
trasolar planet taxonomy. We use as many in- 
puts as possible for this analysis, in particular 
the planetary mass, semi-major axis, eccentric- 
ity, stellar mass and stellar metallicity. We iden- 
tify the best procedure to follow: a multivariate 
statistical analysis (PCA) to find the most im- 
portant variables, and then hierarchical cluster 
analysis. We analyze the solutions via canonical 
means (through the cophcnctic coefhcient) and by 
analysis of the distribution of the dendrogram's 
heights. The best result is achieved with non tra- 
ditional metric and merging algorithms, namely 
the Pearson correlation metric and weighted cen- 
troid cluster merging. We reject the absence of 
clustering structure with Monte Carlo simulations, 
and also test the stability of the solution against 
observational errors of the input variables. The 
procedure we followed is able to provide a roboust 
extrasolar planet taxonomy even if the number 
of planets is still low. The general traits of the 
taxonomy developed here will be updated as more 
planets become available. 

Our best solution consists of five clusters. We dis- 
cuss their properties with respect to the physically 
relevant input variables. We show the importance 
of including the environmental variables [M^ and 
[Fe/H]) to discriminate between otherwise similar 
planets; and also to merge together different bod- 
ies (like EP in MSSs and orbiting single stars). For 
instance, we were able to divide the hot Jupiters 
into -at least- 2 main groups (see tab. [J). This 
division is mainly due to the stellar mass and 



metallicity. Those belonging to C3 basically or- 
bit around stars with super-solar masses and high 
metallicities; those of C2 orbit mostly sub-solar 
mass stars with moderate (both positive and neg- 
ative) metallicities. This may reflect differences in 
the formation processes of these EPs. 
Jupiter belongs to cluster C3. Much as been specu- 
lated about the similarit y of our Solar Sy stem and 



extrasolar systems (e.g. iBeer et al 



2004D . in par- 
ticular concerning the formation histories. With 
the help of cluster analysis we may identify those 
EPs which are more similar -in the 3-fold cluster- 
ing space- to Jupiter. We suggest that the actual 
large semi-major axis of Jupiter is the result of its 
formation in a solar-like metallicity disk. 
We also analyzed the intra-cluster correlations, 
since this may provide important information 
about the formation and evolution of bodies 
within a cluster. This is crucial in order to uncover 
information which may be hidden in the "blind" 
statistical analysis performed on the whole EP 
database. The most important correlations found 
are those for C2, C3 and C4 (see tab. [1]). Remark- 
ably, for C2 and C5 we find important trends be- 
tween metallicity and orbital parameters. We find 
that [Fe/H] has very important effects on the semi- 
major axis (and thus on the migration processes) 
and the eccentricity. It may also happen that 
the same variables correlate in an opposite way 
between two different clusters (see the Mp — Ms 
correlations for CI and C4)- Moreover, we also 
studied the distribution of planets in multiple star 
systems in each cluster. They do not seem to 
play a particular role in the corresponding cluster 
correlations. Similar considerations apply also for 
multiple planet systems. 

In addition to these main five clusters, we may 
see the position of the pulsar planets in the clus- 
tering space. Obviously these planets were not 
included in the previous analysis because we do 
not have Ms and [Fe/H]. However we may use as 
test values Ms — 10 Mq for the progenitors of 
both PSR 1257-1-12 and PSR B1620-26. As for 
the metallicity we assume and -1.05 (the first is 
an indicative value, the latter is the average for 
M4 stars), respectively. Using the formule (1), 
(2) and (3), we find that these planets are very 
far from all the other EPs in the clustering space, 
and hence for each pulsar we have a single cluster. 
This is consistent with the very likely different ori- 
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gin of the pulsar planets with respect to other EPs. 
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Fig. 1. — Distribution of the data in the 3- fold 
clustering space. 



■ ci 

- C2 
C3 

' C4 
C5 



PC, 




Fig. 2. — Dendrogram of the best solution. 
Jupiter position is also reported. The horizontal 
line corresponds to the best cut (see text). 




Fig. 3. — Distribution of the clusters in the clus- 
tering space projected onto the pci — pc2, pci —pc^ 
and pc2 — pc3 planes. Large pluses indicate the 
prototype planet of each cluster (see text). 
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Fig. 4. Guidelines for the planetary taxonomy developed in this paper. As for the Solar System, only 
Jnpiter has been included in this analysis so far (see text for further detail). Below each cluster the corre- 
ponding prototype is reported. 
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Fig. 6. — Significant correlations within cluster CI. The solid line is the best linear fit. Black circles indicate 
hot Jupiters, while crosses correspond to planets in multiple star systems. 




Fig. 7. — Significant correlations within cluster C2. The solid line is the best linear fit. Black circles indicate 
hot Jupiters, crosses correspond to MSS planets and squares to MPS planets (notice the x-axis log scale). 
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Fig. 9. — Significant correlations within cluster C4- The solid line is the best linear fit. Crosses correspond 
to MSS planets and squares to MPS planets. 
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Table 1 

Relevant data for extrasolar planet clusters. Corr: All significant intracluster 
correlations, i.e. having 2-tailed probability less then 5%. those of c2, c3 and cj^ are the 

MOST IMPORTANT ONES. WE SHOW THEM IN THE FIGS. [H [3 AND [9j HT: HOT JUPITERS. T: 

Transiting planets. MSS: Planets in multiple star systems. MPS: Multiple planetary 

SYSTEMS. SS: Solar system planets. 



Cluster 



Prototype 



Members 



Corr 



HJ 


T 


MSS 


MPS 


SS 


2 




6 






17 


5 


4 


13 




23 


4 


11 


8 


Jupiter 






12 


12 




1 




7 


13 





CI 
C2 
C3 
C4 
C5 



HD 41004 A b 
HD 69830 c 
HD 11964 b 
HD 142022 A b 
HD 117207 b 



11 

46 
48 
48 
31 



Mp - e, Mp - Ms 
a - e, a-[Fe/H] 
a-[Fc/H], e--[Fe/H], 
Mp - e, Mp - M, 



[Fe/H] 



17 



Table 2 

Supplementary Material: Details on extrasolar planet clusters. For each planet we 

REPORT the mass (Mj), SEMI-MAJOR AXIS (AU), PERIOD (DAYS), ECCENTRICITY, STELLAR MASS 

(Mq), METALLICITY, and THE FIRST THREE PRINCIPAL COMPONENTS. PlANETS IN BOLD ARE 

PROTOTYPES. Planets in multiple star systems, multiple planet systems, transiting, hot 

JUPITERS ARE ALSO INDICATED. 



Planet Mj, a P e Mg [Fe/H] pci pc2 PC3 



Cluster CI 



HD147513bt 


1.000 


1.260 


538.33 


0.520 


0.920 


-0. 


,030 


-0.426 


-0.801 


0.241 


HD162020b'' 


13.750 


0.072 


8.36 


0.277 


0.700 


0, 


.010 


-1.171 


-2.260 


1.806 


HD178911Bbt 


6.292 


0.320 


70.64 


0.124 


0.870 


0, 


.280 


0.459 


-0.017 


1.041 


HD41004Bb'''''t 


18.400 


0.018 


1.33 


0.081 


0.400 


-0, 


,010 


-0.883 


-3.580 


2.373 


HD3651b+ 


0.200 


0.284 


62.19 


0.630 


0.790 


0, 


.050 


0.135 


-0.829 


1.080 


HD2 16770b 


0.650 


0.460 


120.08 


0.370 


0.900 


0, 


.230 


0.646 


0.260 


0.774 


HD8574b 


2.230 


0.760 


237.07 


0.400 


1.040 


-0, 


,090 


-0.221 


-0.714 


-0.034 


HD114762bt 


11.020 


0.300 


65.86 


0.340 


0.820 


-0, 


,500 


-1.201 


-3.693 


0.337 


HDl 11232b 


6.800 


1.970 


1138.84 


0.200 


0.780 


-0, 


,360 


-0.678 


-2.921 


-0.483 


HD41004Abt 


2.300 


1.310 


653.56 


0.390 


0.700 


-0. 


090 


0.120 


-1.780 


0.368 


HD65216b 


1.210 


1.370 


610.27 


0.410 


0.920 


-0, 


,120 


-0.159 


-1.138 


-0.160 


Cluster CS 


rlU iZioU4D 


0.890 


0.320 


66.09 


0. 130 


1 .000 





,160 


1 .306 


0.358 


0.092 


XJTTD 1 /I 1 n^T 
rlir^i4oiUC 


0.951 


0.458 


113.73 


0.281 


0.990 





,230 


0. 727 


0.542 


0.484 


rlij iUzi 110 


0. 140 


0. 149 


2 1 .55 


0.060 


0.9.50 





,180 


1.88.5 


0..37.3 


0.084 


HD76700b'' 


0.197 


0.049 


3.96 


0.130 


1.000 


0, 


,140 


1.594 


0.345 


0.078 


HD209458b'''* 


0.690 


0.045 


3.47 


0.070 


1.010 


0, 


,040 


1.675 


-0.037 


-0.208 


HD2638b'' 


0.480 


0.044 


3.49 


-0.000 


0.930 


0, 


,160 


2.112 


0.228 


0.035 


HD195019bt 


3.670 


0.138 


18.16 


0.014 


1.060 


0, 


,080 


1.114 


0.043 


-0.077 


HD23079b 


2.610 


1.650 


737.31 


0.100 


1.100 


-0, 


,240 


0.124 


-1.075 


-1.245 


HD102195b'' 


0.488 


0.049 


4.12 


0.060 


0.926 


0, 


,096 


1.906 


-0.056 


0.017 


HD330075b'' 


0.760 


0.043 


3.34 


-0.000 


0.950 


0, 


,030 


2.010 


-0.243 


-0.243 


HD4208b 


0.800 


1.670 


817.08 


0.050 


0.930 


-0. 


,240 


0.966 


-1.432 


-1.206 


HD27894b 


0.620 


0.122 


17.97 


0.049 


0.750 


0, 


.300 


2.211 


0.192 


0.713 


HD4308b 


0.047 


0.114 


15.43 


-0.000 


0.830 


-0. 


,310 


2.314 


-1.889 


-0.822 


HD37124d* 


0.600 


1.640 


803.93 


0.140 


0.910 


-0, 


,320 


0.747 


-1.820 


-1.183 


HD69830C* 


0.038 


0.186 


31.59 


0.130 


0.860 


-0. 


050 


1.795 


-0.822 


-0.129 


HD37124b* 


0.610 


0.530 


147.69 


0.055 


0.910 


-0, 


,320 


1.638 


-1.760 


-0.970 


HD69830b'''* 


0.033 


0.079 


8.66 


0.100 


0.860 


-0, 


,050 


1.957 


-0.809 


-0.148 


HD189733b'''''* 


1.150 


0.031 


2.26 


-0.000 


0.800 


-0, 


,030 


2.205 


-0.973 


-0.066 


OGLE-TR-113b'''''' 


1.320 


0.023 


1.43 


-0.000 


0.780 


0, 


.140 


2.223 


-0.381 


0.338 


HD192263b 


0.720 


0.150 


23.86 


-0.000 


0.790 


-0, 


,200 


2.236 


-1.638 


-0.476 


TrES-l'*'* 


0.610 


0.039 


3.05 


0.135 


0.870 


0, 


.001 


1.725 


-0.636 


0.067 


HD117618b 


0.190 


0.280 


52.81 


0.390 


1.050 


0, 


,040 


0.484 


0.013 


0.174 


HD130322b 


1.080 


0.088 


10.72 


0.048 


0.790 


-0, 


,020 


2.044 


-0.976 


0.034 


HD101930b 


0.300 


0.302 


70.46 


0.110 


0.740 


0, 


,170 


1.980 


-0.348 


0.484 


HD168746b'' 


0.230 


0.065 


6.31 


0.081 


0.920 


-0, 


,070 


1.878 


-0.712 


-0.306 


OGLE-TR-lllb'''' 


0.530 


0.047 


4.11 


-0.000 


0.820 


0, 


,120 


2.298 


-0.272 


0.148 


HD69830d* 


0.058 


0.630 


196.95 


0.070 


0.860 


-0, 


,050 


1.755 


-0.817 


-0.381 


rhoCrBb 


1.040 


0.220 


38.65 


0.040 


0.950 


-0, 


190 


1.705 


-1.148 


-0.658 


HD37124C* 


0.683 


3.190 


2180.82 


0.200 


0.910 


-0, 


,320 


-0.316 


-1.894 


-1.577 


HD83443b'' 


0.410 


0.040 


3.29 


0.080 


0.790 


0, 


,330 


2.122 


0.443 


0.769 


HD63454b'' 


0.380 


0.036 


2.79 


-0.000 


0.800 


0, 


,110 


2.371 


-0.360 


0.153 


HD114729bt 


0.820 


2.080 


1135.74 


0.310 


0.930 


-0. 


,220 


-0.148 


-1.455 


-0.831 


HD93083b 


0.370 


0.477 


143.79 


0.140 


0.700 


0, 


.150 


1.840 


-0.570 


0.515 


HD6434b 


0.480 


0.150 


21.21 


0.300 


1.000 


-0, 


,520 


0.856 


-2.327 


-0.987 


G1581b'' 


0.056 


0.041 


5.45 


-0.000 


0.310 


-0, 


,330 


3.299 


-3.568 


0.071 


Gliese876d'''''* 


0.023 


0.021 


1.94 


-0.000 


0.320 


-0, 


,120 


3.312 


-2.710 


0.490 


G186bt 


4.010 


0.110 


14.96 


0.046 


0.790 


-0, 


,240 


1.421 


-2.081 


-0.168 


HD99492bt 


0.109 


0.123 


17.88 


0.254 


0.780 


0, 


.360 


1.564 


0.494 


1.106 


GJ436b'' 


0.067 


0.028 


2.64 


0.207 


0.410 


-0, 


,320 


2.414 


-3.291 


0.293 


Gliese876b* 


1.935 


0.208 


61.00 


0.025 


0.320 


-0, 


,120 


2.732 


-2.882 


0.645 


HIP14810b'''* 


3.840 


0.069 


6.67 


0.148 


0.990 


0, 


.230 


0.796 


0.359 


0.633 


HD47536b 


4.960 


1.610 


767.70 


0.200 


0.940 


-0, 


,680 


-0.418 


-3.520 


-1.469 
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Table 2 — Continued 



Planet 




a 


P 




e 


M,, 


[Fo/H] 


P 


ci 


PC2 


PC3 


Glioso876c* 


0.560 


0.130 


30.24 


0, 


,270 


0.320 


-0, 


120 


2 


.220 


-2.848 


0.987 


HD150706b 


1.000 


0.820 


279.61 


0, 


,380 


0.940 


-0. 


130 





.248 


-1.072 


-0.107 


HD128311b* 


2.180 


1.099 


469.89 


0, 


,250 


0.800 


0, 


,080 





.567 


-0.742 


0.352 


HD128311C* 


3.210 


1.760 


951.71 


0, 


.170 


0.800 


0, 


,080 


0, 


.270 


-0.820 


0.083 


Cluster C3 


rlUiooUiob ' 


1.260 


1.190 


456.01 


0. 


150 


1.080 


0, 


290 





.552 


1.051 


0.003 


T T71 rin -1 o o'U 1^ ^ t 

OCjLh.- i K-lo2b ' 


1.190 


0.031 


1.68 


-0. 


,000 


1.350 


0. 


430 


1 


.225 


2.522 


-0.074 


rlJJio4yo 1 D 


1.580 


0.780 


245.38 


0. 


,240 


1.050 


0. 


,230 





.452 


0.679 


0.257 




0.710 


0.233 


35.61 


0. 


,050 


1.330 


0. 


,343 


1 


.074 


2.136 


-0.239 


Jupiter 


1 nnn 






u. 


,U4c5 


1 nnn 
l.uuu 


n 

u. 


nnn 
,uuu 


-1 


noQ 


n QQ7 


1 OQn 
-i.you 


■HT\ ?Q t; Oft 1 


O KC\C\ 

z.ouu 


1 C\KC\ 




u. 


1 /in 

. 14U 


1 non 


n 

u. 




n 
u 


til ft 

.010 


n 7QO 


n 007 


GammaCepheib^ 


1.430 


1.940 


one f\'7 


0. 


.200 


1.180 


0. 


,000 


-0 


.262 


0.168 


-0.911 


rlUicSu^oy D 


u.y4:U 


U.U 1 / 


ft on 
o.yu 


u. 


.oUU 


1 oftn 


n 
u. 


1 1 n 


n 
u 


.OOO 


n Q7n 
u.y ( u 


n 1 1 1 

-U.l 11 




2 000 


3 300 


2187 59 


Q 


100 


1 000 


Q 


160 


-0 


437 


187 


-0 842 


rlUi4yu^OD 


0.360 


0.042 


2.76 


-0. 


.000 


1.300 


0. 


,360 


1 


.476 


2.162 


-0.209 


riU^Uo4o/ D 


0.450 


0.490 


109.86 


0. 


,320 


1.300 


-0. 


,060 





.093 


0.387 


-0.641 


HD27442b ' 


1.280 


1.180 


427.19 


0. 


.070 


1.200 


0. 


,200 





.602 


1.093 


-0.531 


HD88133b'' 


0.220 


0.047 


3.40 


0. 


,110 


1.200 


0. 


,340 


1 


.308 


1.750 


0.107 


BD-103166b'' 


0.480 


0.046 


3.43 


0, 


,070 


1.100 


0, 


,500 


1 


.584 


2.061 


0.564 


HD68988b'' 


1.900 


0.071 


6.30 


0. 


140 


1.200 


0, 


240 





.840 


1.208 


0.098 


HD52265b 


1.130 


0.490 


117.80 


0, 


290 


1.130 


0, 


110 





.377 


0.484 


0.014 


wpi 1 n7i /I sv-i 






/I S 1 


n 
u. 


050 


1 120 


n 
u. 


'^1 /I 

014 




.000 


1 416 


012 


WFlinQ7/1QV-i'^'t 


280 


U.UD4 


O.OO 


u. 


m n 

UlU 


1 200 


Q 


250 




622 


1 425 


n 

-U..iOo 


WF191 71 nTV-i'^' i 


1 330 


U.U / o 


7 1 
i . lo 


u. 


, loz 


1 020 


n 
u. 






320 


1 213 


n Ri /I 

U . D 1 4 


UT^i /ini ,iqk'^ 
rlij 149 14oD 


1.330 


0.053 


4.05 


0. 


,016 


1.210 


0, 


260 


1 


.374 


1 .407 


-0. 146 


tn~\ o n o 1 ^ 

rlU8DU8ib 


1.500 


0.039 


2.56 


0. 


,008 


1.210 


0, 


257 


1 


.373 


1.384 


-0. 147 


HD160691ct 


0.522 


0.921 


310.59 


0, 


,067 


1.080 


0, 


,280 


1 


.134 


1.109 


-0.145 


UpsAndb''' + 't 


0.690 


0.059 


4.59 


0, 


,029 


1.300 


0, 


130 


1 


.285 


1.222 


-0.608 


HD 104985b 


6.300 


0.780 


205.04 


0, 


,030 


1.500 


-0, 


,350 


-0 


.661 


-0.530 


-1.680 


HD179949b'' 


0.950 


0.045 


3.08 


0, 


,022 


1.280 


0, 


,220 


1 


.305 


1.495 


-0.372 


55Cnce'''t't 


0.045 


0.038 


2.67 


0, 


,174 


1.030 


0. 


,290 


1 


.437 


1.024 


0.404 


HD75289b'' 


0.420 


0.046 


3.52 


0, 


.054 


1.050 


0, 


,290 


1, 


.729 


1.094 


0.184 


HD164922b 


0.360 


2.110 


1154.49 


0, 


,050 


0.940 


0, 


,170 





.827 


0.229 


-0.562 


HD73256b'' 


1.870 


0.037 


2.53 


0, 


.030 


1.050 


0, 


,290 


1, 


.517 


0.983 


0.274 


HD99109b 


0.502 


1.105 


439.84 


0, 


,090 


0.930 


0, 


,315 


1, 


.235 


0.772 


0.169 


HDl 14783b 


0.990 


1.200 


500.34 


0, 


,100 


0.920 


0, 


,330 


1, 


.068 


0.754 


0.248 


OGLE-TR-lOb'*'* 


0.630 


0.042 


2.85 


-0, 


.000 


1.180 


0, 


,120 


1, 


.624 


0.829 


-0.470 


HD160691d'''* 


0.044 


0.090 


9.49 


-0, 


.000 


1.080 


0, 


,280 


1, 


.911 


1.195 


-0.034 


55Cncbt'* 


0.784 


0.115 


14.03 


0, 


,020 


1.030 


0, 


290 


1 


.770 


1.012 


0.168 


HAT-P-lb'*-' 


0.530 


0.055 


4.46 


0, 


,090 


1.120 


0, 


130 


1 


.440 


0.661 


-0.197 


47Umab 


2.600 


2.110 


1101.77 


0, 


,049 


1.030 


0, 


,060 





.198 


-0.110 


-0.747 


HD20367b 


1.070 


1.250 


500.32 


0, 


,230 


1.040 


0, 


100 





.346 


0.170 


-0.212 


HD 10647b 


0.910 


2.100 


1074.16 


0. 


180 


1.070 


-0. 


,030 





.025 


-0.243 


-0.916 


HD11964b+ 


0.110 


0.229 


37.74 


0. 


150 


1.125 


0. 


170 


1. 


220 


0.843 


-0.110 


HD108874b 


1.360 


1.051 


393.31 


0, 


,070 


1.000 


0, 


140 


1 


.016 


0.239 


-0.255 


HD16141b+ 


0.230 


0.350 


75.62 


0, 


210 


1.000 


0, 


,220 


1 


.156 


0.620 


0.290 


51Pcgb'' 


0.468 


0.052 


4.21 


-0, 


,000 


1.060 


0, 


160 


1 


.873 


0.629 


-0.196 


HD212301b"'' 


0.450 


0.036 


2.43 


-0, 


,000 


1.050 


0, 


180 


1 


.905 


0.679 


-0.134 


HD187123b'' 


0.520 


0.042 


3.05 


0, 


,030 


1.060 


0, 


160 


1 


.765 


0.615 


-0.134 


HD46375b'''t 


0.249 


0.041 


3.03 


0, 


,040 


1.000 


0, 


,250 


1 


.903 


0.802 


0.150 


HD49674b'' 


0.110 


0.057 


4.94 


0, 


160 


1.000 


0, 


,250 


1 


.513 


0.773 


0.348 


HD190360ct'* 


0.057 


0.128 


17.07 


0, 


,010 


0.960 


0. 


,240 


2 


.070 


0.663 


0.100 


HD196885b 


1.840 


1.120 


383.91 


0. 


,300 


1.270 


-0, 


,390 


-0, 


.434 


-1.127 


-1.387 


Cluster 


HD45350b 


1.790 


1.920 


961.39 


0, 


,778 


1.020 


0, 


,290 


-1 


.989 


0.590 


1.043 



19 



Table 2 — Continued 



Planet 


Mp 


a 


P 


e 


Ms 


[Fo/H] 


pci 


PC2 


PC3 



HD74156ct 


6, 


,170 


3, 


,400 


2228, 


,53 


0.583 


1 


.050 





,130 


-3 


,093 


-0 


,286 


0, 


,217 


HD142415b 


1 


,620 


1, 


,050 


386, 


,94 


0.500 


1 


.030 





,210 


-0 


,556 


0, 


,440 





,632 


HD50554b 


4, 


,900 


2, 


,380 


1276, 


,02 


0.420 


1 


.100 





,020 


-1 


,820 


-0, 


,374 


-0, 


,168 


HD142022Ab+ 


4. 


400 


2. 


800 


1716. 


36 


0.570 


0. 


990 


0. 


190 


-2. 


245 


-0. 


067 


0. 


461 


HD30177b 


9, 


.170 


3, 


,860 


2829, 


,03 


0.300 





.950 


0, 


,250 


-2, 


,803 


-0, 


,291 





.250 


HD136118b 


11, 


,900 


2, 


,300 


1138, 


,96 


0.370 


1, 


.240 


-0, 


,065 


-3 


,307 


-0 


.836 


-0, 


.024 


HD39091b 


10, 


,350 


3, 


,290 


2069, 


,02 


0.620 


1, 


.100 





.090 


-4, 


,114 


-0, 


,644 


0, 


.525 


HD4203b 


1, 


,650 


1, 


,090 


403, 


,43 


0.460 


1, 


.060 





.220 


-0, 


.501 


0, 


,582 


0, 


.518 


HD89744bt 


7, 


,990 


0, 


,890 


258, 


,49 


0.670 


1, 


.400 





.180 


-3 


.038 





.883 


0, 


.853 


14Herb 


4, 


,640 


2, 


,770 


1770, 


,69 


0.369 





.900 


0, 


.430 


-1 


.411 





.645 





.784 


55Cncct't 


0, 


,217 


0, 


,240 


42, 


.31 


0.440 


1 


.030 


0, 


.290 


0, 


.383 


0, 


.914 





.830 


HD 190228b 


4, 


,990 


2, 


,310 


1122, 


.69 


0.430 


1 


.300 


-0, 


.240 


-2 


.217 


-0, 


.786 


-1 


.005 


HD38529ct't 


12, 


,700 


3, 


,680 


2177, 


.65 


0.360 


1 


.390 





.313 


-4 


.434 


1 


.004 


0, 


.094 


HD160691C* 


3, 


,100 


4, 


,170 


2988, 


.87 


0.570 


1 


.080 





.280 


-2 


.879 





.630 


-0, 


,078 


HD122430b 


3, 


,710 


1, 


,020 


318, 


.75 


0.680 


1 


.390 


-0, 


.050 


-2 


.259 


0, 


.297 


-0, 


,013 


HD92788b 


3, 


,860 


0, 


,970 


338, 


.35 


0.270 


1 


.060 





.240 


-0 


.240 


0, 


.544 





,456 


HD213240bt 


4, 


,500 


2, 


,030 


954, 


.79 


0.450 


1 


.220 





.230 


-1 


.855 





.853 


0, 


,188 


HD217107C* 


2, 


,500 


4, 


,410 


3345, 


.49 


0.537 


1 


.020 





.370 


-2 


.658 





.851 


0, 


,021 


55Cncd+'* 


3, 


,920 


5, 


,257 


4330, 


.22 


0.327 


1 


.030 


0, 


.290 


-2, 


.716 





.496 


-0, 


,690 


HD66428b 


2, 


,820 


3, 


,180 


1932, 


.63 


0.465 


1 


.146 


0, 


.310 


-2, 


.044 


1 


.039 


-0, 


,019 


HD 10697b 


6, 


,120 


2, 


,130 


1079, 


.77 


0.110 


1, 


.100 


0, 


.150 


-0 


.867 


0, 


.147 


-0, 


.265 


HDl90360b1''* 


1, 


,502 


3, 


,920 


2891, 


.20 


0.360 





.960 





.240 


-1 


.481 





.313 


-0, 


.389 


HD 183263b 


3 


,690 


1, 


,520 


631, 


.87 


0.380 


1, 


.170 


0, 


.300 


-1 


.076 


1, 


.079 


0, 


.389 


HD23596b 


7, 


,190 


2, 


,720 


1450, 


.07 


0.314 


1, 


.270 





.320 


-2, 


.403 


1, 


.162 


0, 


.057 


HD12661b* 


2, 


,300 


0, 


,830 


266, 


.74 


0.350 


1, 


.070 





.293 


-0 


.131 





.890 


0, 


.598 


16CygBbt 


1, 


.680 


1, 


,680 


798, 


.74 


0.689 


0, 


.990 





.080 


-1 


.492 


-0, 


.280 


0, 


.572 


HD2039b 


4, 


,850 


2, 


,190 


1193, 


.00 


0.680 





.980 





.100 


-2 


.371 


-0, 


.507 


0, 


.731 


HD80606bt 


3 


.410 


0, 


,439 


111, 


.79 


0.927 





.900 





.430 


-1 


.800 





.630 


2, 


.441 


HD202206bt 


17, 


.400 


0, 


,830 


255, 


.72 


0.435 


1 


.150 





.370 


-3 


.670 





.160 


2, 


.124 


HD33564b 


9, 


.100 


1, 


,100 


375, 


.61 


0.340 


1 


.250 


-0, 


.120 


-1 


.999 


-0, 


.743 


-0, 


.066 


HD20782bt 


1 


.800 


1, 


,360 


578, 


.82 


0.920 


1 


.000 


-0, 


.050 


-2 


.159 


-0 


.836 


0, 


.817 


HD222582bt 


5, 


.110 


1, 


,350 


571, 


.55 


0.760 


1 


.000 


-0, 


.010 


-2 


.285 


-0 


.900 


0, 


.912 


HD28185b 


5, 


.700 


1, 


,030 


382, 


.70 


0.070 





.990 





.240 





.160 


0, 


.242 


0, 


.365 


HD106252b 


6, 


.810 


2, 


,610 


1498, 


.43 


0.540 


1 


.050 


-0, 


.160 


-2 


.669 


-1 


.439 


-0, 


.141 


HD168443C* 


16, 


.870 


2, 


,840 


1769, 


.45 


0.222 





.960 


0, 


.100 


-3, 


.595 


-1, 


.429 


0, 


.809 


HIP75458b 


8, 


.820 


1, 


,275 


511, 


.15 


0.712 


1, 


.050 





.030 


-2 


.934 


-0, 


.878 


1, 


.179 


HD141937b 


9, 


.700 


1, 


,520 


681, 


.35 


0.410 


1, 


.000 





.160 


-2 


.115 


-0, 


.501 


0, 


.991 


HD33636b 


9, 


.280 


3, 


,560 


2454, 


.89 


0.530 


0, 


.990 


-0, 


.130 


-3 


.547 


-1, 


.735 


-0 


.083 


HD74156b* 


1 


.860 


0, 


,294 


56, 


.78 


0.636 


1 


.050 


0, 


.130 


-0, 


.702 


0, 


.145 


0, 


.947 


EpsilonEridanib 


1 


.550 


3, 


,390 


2500, 


.25 


0.702 





.830 


-0, 


.100 


-2 


.158 


-1 


.523 


-0, 


.070 


HD210277b 


1 


.230 


1, 


,100 


439, 


.06 


0.472 





.920 


0, 


.190 


-0 


.208 


0, 


.063 





.682 


70Virb 


7, 


.440 


0, 


,480 


115, 


.45 


0.400 


1 


.100 


-0, 


.030 


-1 


.246 


-0, 


.716 





.545 


HD37605b 


2, 


.300 


0, 


,250 


50 


.98 


0.677 





.800 





.390 


-0 


.435 


0, 


.346 


2, 


.048 


HD168443bt 


7, 


.480 


0, 


,290 


58 


.00 


0.530 





.960 





.100 


-1 


.332 


-0 


.679 


1 


.357 


GJ3021bt 


3, 


.320 


0, 


,490 


131 


.83 


0.505 





.900 


0, 


.200 


-0, 


.383 


-0 


.125 


1 


.185 


HD154857b 


1 


.800 


1, 


,110 


394, 


.62 


0.510 


1 


.170 


-0, 


.230 


-0, 


.945 


-0, 


.874 


-0, 


.506 


HD81040b 


6, 


.860 


1, 


,940 


1003, 


.92 


0.526 


0, 


.960 


-0, 


.160 


-2 


.103 


-1, 


.695 


0, 


.216 



Cluster C5 



HD160691bt 


1.670 


1.500 


645.23 


HD202206C* 


2.440 


2.550 


1385.58 


HD11977b 


6.540 


1.930 


707.49 


UpsAndd^'* 


3.950 


2.510 


1272.09 


HD196050bf 


3.000 


2.500 


1374.86 


HD40979bt 


3.320 


0.811 


256.33 


HD187085b 


0.750 


2.050 


970.36 


HD33283b 


0.330 


0.168 


22.58 


HD108147b 


0.400 


0.104 


10.87 



0.310 


1 


.080 


0.280 


-0.248 


0.915 


0.205 


0.267 


1 


.150 


0.370 


-0.951 


1.403 


-0.085 


0.400 


1 


.910 


-0.210 


-3.337 


1.104 


-1.800 


0.242 


1 


.300 


0.130 


-1.444 


0.808 


-0.739 


0.280 


1 


.100 


0.000 


-1.017 


-0.251 


-0.671 


0.230 


1, 


.080 


0.194 


0.054 


0.488 


0.258 


0.470 


1, 


.220 


0.050 


-1.174 


0.450 


-0.491 


0.480 


1, 


.240 


0.366 


-0.116 


1.839 


0.725 


0.498 


1, 


.270 


0.200 


-0.223 


1.270 


0.390 
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Table 2 — Continued 



Planet 


Mp 


a 


P 


e 


M, 


[Fe/H] 


pci 


PC2 


PC3 


HD169830C* 


4.040 


3.600 


2105.72 


0.330 


1.400 


0, 


210 


-2.532 


1.360 


-0.941 


HD177830b 


1.280 


1.000 


337.51 


0.430 


1.170 


0. 


,000 


-0.489 


0.101 


-0.185 


HD82943C* 


2.010 


0.746 


219.28 


0.359 


1.150 


0, 


,270 


-0.204 


1.069 


0.428 


HD216437b 


2.100 


2.700 


1565.15 


0.340 


1.070 


0, 


,000 


-1.091 


-0.295 


-0.657 


HD12661C* 


1.570 


2.560 


1445.35 


0.200 


1.070 


0, 


293 


-0.407 


0.948 


-0.306 


HD89307b 


2.730 


4.150 


2737.38 


0.270 


1.270 


-0, 


,230 


-2.148 


-0.653 


-2.027 


HD73526b* 


2.900 


0.660 


193.66 


0.190 


1.020 


0, 


,280 


0.474 


0.694 


0.481 


HDlir207b 


2.060 


3.780 


2629.78 


0.160 


1.040 


0. 


270 


-0.981 


0.702 


-0.730 


HD142bt 


1.000 


0.980 


337.72 


0.380 


1.100 


0, 


,040 


-0.119 


0.083 


-0.089 


HD50499b 


1.710 


3.860 


2456.46 


0.230 


1.270 


0, 


,230 


-1.613 


1.257 


-1.146 


HD169830b* 


2.880 


0.810 


224.83 


0.310 


1.400 


0, 


,210 


-0.711 


1.546 


-0.164 


HD2 16435b 


1.490 


2.700 


1448.61 


0.340 


1.250 


0, 


150 


-1.283 


0.898 


-0.717 


HD19994b+ 


2.000 


1.300 


465.64 


0.200 


1.350 


0, 


,230 


-0.327 


1.566 


-0.472 


HD82943b* 


1.750 


1.190 


441.84 


0.219 


1.150 


0, 


,270 


0.087 


1.124 


0.008 


HD38529bt'* 


0.780 


0.129 


14.35 


0.290 


1.390 


0, 


,313 


0.185 


2.120 


0.066 


HR810b 


1.940 


0.910 


300.71 


0.240 


1.110 


0, 


,250 


0.199 


0.908 


0.183 


HD108874C* 


1.018 


2.680 


1601.77 


0.250 


1.000 


0, 


140 


-0.412 


0.158 


-0.498 


UpsAndc*'* 


1.980 


0.830 


242.07 


0.254 


1.300 


0, 


130 


-0.168 


1.017 


-0.342 


HD118203b'' 


2.130 


0.070 


6.09 


0.309 


1.230 


0, 


100 


0.151 


0.676 


0.081 


47Umact 


1.340 


7.730 


7730.20 


-0.000 


1.030 


0, 


,060 


-2.425 


-0.157 


-2.795 


HD62509b 


2.900 


1.690 


587.98 


0.020 


1.860 


0. 


190 


-1.041 


2.954 


-1.817 


HD72659b 


2.960 


4.160 


3174.99 


0.200 


0.950 


-0, 


,140 


-1.372 


-1.282 


-1.390 


^Planets in multiple star systems. 
^Planets in multiple planetary systems. 



^Hot Jupiters. 
^'^Very hot Jupiters. 



^Transiting planets. 
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